Aap?cJV(3a  tor  ptujuc  rsleci£j>5 
E^iiyuoas  Uaisjaatad 


J 


The  Probability  of  Pure  Literals 


John  W.  Rosenthal,  Department  of  Mathematics  and  Computer  Science, 
Ithaca  College,  Ithaca,  NY  14850 
E-Mail:  rosentha@ithaca.edu 

J.  M.  Plotkin,  Department  of  Mathematics,  Michigan  State  University, 

East  Lansing,  MI  48824 
E-Mail:  plotkin@math.msu.edu 

John  Franco,  Department  of  Computer  Science,  University  of  Cincinatti, 

Cincinatti,  OH  45221 

E-Mail:  franco@franco.csm.uc. edu 

Address  all  correspondence  to  J.M.  Plotkin  at  the  above  address/e-mail. 

Abstract 

We  describe  an  error  in  earlier  probabilistic  analyses  of  the  pure  literal  heuristic 
as  a  procedure  for  solving  k-SAT.  All  probabilistic  analyses  are  in  the  constant 
degree  model  in  which  a  random  instance  C  of  k-SAT  consists  of  m  clauses  selected 
independently  and  uniformly  (with  replacement)  from  the  set  of  all  fc-clauses  over 
n  variables.  We  provide  a  new  analysis  for  k  =  2.  Specifically,  we  show  with 
probability  approaching  1  as  m  goes  to  oo  one  can  apply  the  pure  literal  rule 
repeatedly  to  a  random  instance  of  2-SAT  until  the  number  of  clauses  is  “small” 
provided  n/m  >  A  >  1.  But  ifn/m<A<l,  with  probability  approaching  1  if 
the  pure  literal  rule  is  applied  as  much  as  possible,  then  at  least  clauses  will 
remain. 

Keywords:  2-SAT,  constant  degree  model,  Davis-Putnam  Procedure,  pure  literal 
(heuristic),  probability  of  a  pure  literal 
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DTIC  QUALITY  XITSPECTED  S 


1  Introduction 


The  satisfiability  problem  for  sets  of  propositional  clauses  is  the  primordial 
^P-complete  problem  (See[14,  p.38]).  Also  as  shown  in  [14],  the  satisfiability 
problem  for  sets  of  A:-clauses  [k-SAT)  is  A/’-complete  for  k  >  2.  The  lack  of 
polynomial  time  algorithms  for  AP-complete  problems  has  given  rise  to  the 
search  for  fast  heuristic  schemes  for  their  solution  which  will  be  successful 
“almost  surely”.  Franco  [10]  and  the  rest  of  us  [18]  give  probabilistic  analyses 
of  the  pure  literal  heuristic  as  a  procedure  for  solving  k-SAT.  Unfortunately 
a  serious  error  vitiates  the  outcomes.  We  describe  the  error  in  §2.  For  2-SAT 
we  provide  a  different  analysis  in  §3  showing  the  efficiency  of  the  pure  literal 
heuristic  in  the  cases  claimed  by  Franco.  This  analysis  is  based  partly  on  the 
Chvatal  and  Reed  [7]  and  Goerdt  [15]  analysis  of  the  probability  that  a  set 
of  2-clauses  is  satisfiable.  In  §4  we  show  the  optimal  nature  of  the  result  in 

§3. 

All  of  these  analyses  are  in  the  constant  degree  model  for  formulae  in 
conjunctive  normal  form  {CNF)  having  a  sequence  of  m  clauses  each  of 
which  is  a  /c-clause,  that  is,  has  a  set  of  k  literals  selected  from  a  set  V  of 
n  Boolean  variables  and  their  negations.  In  this  model  clauses  are  selected 
independently  from  one  another  with  the  uniform  distribution  on  the  set  of 
all  possible  /c-clauses,  denoted  Qk{V)  or  Qk{n).  To  allow  changes  in  param¬ 
eters  Franco  and  Pauli  [12]  denote  this  distribution  as  f{m,n.,k).  We  omit 
k  as  it  is  clear.  This  model  has  been  used  in  many  studies  of  k-SAT  includ¬ 
ing  Brown  and  Purdom  [3],  Purdom  and  Brown  [19],  Franco  and  Paull[12], 
Franco  [10],  Franco,  Plotkin,  and  Rosenthal  [13],  M-T  Chao  and  Franco  [5] 
and  [6],  Chvatal  and  Reed  [7],  and  Goerdt  [15].  It  has  also  been  used  in 
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the  study  of  k-Exact  SAT  in  Rosenthal,  Speckenmeyer,  and  Kemp  [22]  and 
Rosenthal  [21].  The  reader  should  be  cautioned  that  among  these  papers 
there  is  confusing  variation  in  the  symbols  used  for  the  number  of  variables, 
the  number  of  clauses,  and  the  number  of  literals  per  clause. 

The  pure  literal  heuristic  is  based  on  the  pure  literal  rule  which  is  part 
of  the  Davis-Putnam  Procedure  (DPP)  [8],  an  algorithm  for  satisfiability 
(SAT)  and  k-SAT.  Franco  and  Pauli  [12]  provide  a  recent  description.  [12], 
[10],  [13],  [5],  [6],  etc.  study  aspects  of  DPP  using  the  constant  degree  model. 
Goldberg  [16],  Goldberg,  Purdom,  and  Brown  [16],  Purdom  and  Brown  [20], 
Bugrara,  Pan,  and  Purdom  [4],  Franco  [11],  etc.  study  aspects  of  DPP  for 
SAT  using  another  distribution,  the  constant  density  model.  A  lot  of  the 
work  in  the  constant  density  model  involves  the  pure  literal  rule.  Recently 
Broder,  Freize,  and  Upfal  [2]  used  another  model  to  study  the  pure  literal 
rule  for  3-clauses.  In  this  model  there  are  also  m  A;-clauses,  but  the  km 
literals  are  chosen  uniformly  and  independently  from  the  set  of  2n  available 
literals. 

A  pure  literal  for  a  formula  in  CNF  is  a  literal  I  which  occurs  in  at  least 
one  clause,  but  whose  negation  does  not  occur  in  any  clause.  The  pure  literal 
rule  declares  /  to  be  true,  and  deletes  all  clauses  containing  1.  The  pure  literal 
heuristic  PL  keeps  trying  to  apply  the  pure  literal  rule  until  the  sequence  of 
remaining  clauses  is  so  small  that  the  splitting  rule  (another  one  of  DPP's 
rules)  will  efficiently  determine  satisfiability.  If  the  pure  literal  rule  can  not 
be  applied  this  often,  the  pure  literal  heuristic  gives  up.  Franco  [10]  provides 
a  more  formal  description  of  PL. 
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In  [10]  Franco  examines  the  performance  of  PL  applied  to  random  (ac¬ 
cording  to  the  constant  degree  model)  formulae  with  m  clauses  from  Qk{n) 
where  n  —  \m  and  A  >  1.  His  intention  is  to  show  that  asymptotically  (in 
m)  almost  always  PL  can  be  applied  until  only  logj  m  clauses  remain.  He 
does  not  analyze  PL  directly,  but  rather  analyzes  a  procedure  PL'  whose 
probability  of  giving  up  is  at  least  the  probability  PL  gives  up.  In  PL'  if 
a  pure  literal  occurs  in  r  >  1  clauses,  then  one  reintroduces  r  —  1  random 
A;-clauses  built  using  the  variables  not  yet  assigned  a  truth  value.  For  the 
reader’s  convenience  we  paraphrase  Franco’s  description  of  PL' . 

In  the  following  description  A  denotes  the  set  of  variables  not  yet  assigned 
a  truth  value,  D  denotes  the  sequence  of  remaining  fc-clauses,  and  E  denotes 
the  set  of  fc-clauses  that  are  reintroduced.  Initially  A  =  V ^  D  is  the  sequence 
of  fc-clauses  from  the  original  random  formulae,  and  E  =  ^ 

Procedure  PL'(D,  A,  h) 

(1)  while  \D\  >  h  do 

(2)  if  there  is  no  pure  literal  in  D,  then  “give  up” 

(3)  else  begin 

(4)  choose  a  pure  literal  I  in  D] 

(5)  delete  the  variable  of  I  from  A] 

(6)  delete  all  clauses  containing  I  from  D; 

(7)  if  r  clauses  were  removed  in  line  (6),  select  r  —  1  clauses 
independently  and  uniformly  from  Qk{A),  and  adjoin 
these  to  both  D  and  E. 

end;  (8)  return  “satisfiable”  or  “unsatisfiable”  as  determined 
by  the  splitting  rule  applied  to  D  —  E] 
end  PL' 
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2  The  Error 


Lemma  1  of  [10]  is  the  calculation  of  the  probability  that  PL'  does  not  give 
up  on  an  /(m,  n)-random  instance  (where  n  >  m).  It  is  claimed  that  after 
the  i***  iteration  of  the  while  loop  in  PL'  the  sequence  Z)  is  an  /(m  —  z,  n  —  i)- 
random  instance,  i.e.  that  the  clauses  of  D  are  independently  and  uniformly 
selected  from  QkiA).  We  show  this  is  false.  A  similar  error  occurs  in  [17] 
and  is  essentially  a  sophisticated  version  of  Bertrand’s  paradox. 

To  show  that  the  claim  is  false  we  must  first  clarify  line  (4)  of  PL'. 
The  choice  of  a  pure  literal  in  D  may  be  made  algorithmically  or  randomly 
(viewing  PL'  as  a  randomized  algorithm).  In  the  random  case  for  every 
occurrence  of  line  (4)  one  assigns  probabilities  (or  “weights”)  to  the  possible 
choices  of  pure  literal.  We  call  any  such  assignment  a  weighting  scheme.  For 
example,  we  could  assign  each  pure  literal  equal  weight.  The  algorithmic 
case  can  be  viewed  as  a  special  instance  of  the  random  case  in  which  we 
use  a  weighting  scheme  that  always  assigns  one  pure  literal  weight  1  and  the 
others  weight  0. 

We  show  that  for  no  weighting  scheme  does  one  retain  independence 
after  the  completion  of  one  pass  through  the  while  loop  of  PL'.  For  ease  of 
exposition  we  do  this  for  m  =  4,  n  =  6,  A;  =  3. 

We  represent  a  clause  by  listing  its  variables  and  beneath  each  variable 
placing  +  or  —  to  indicate  a  positive  or  negative  occurrence  of  that  variable. 

Let  A  =  {ui  I  1  <  f  <  6}  and  ej  e  {+,  -}  for  1  <  j  <  3.  For  each  se¬ 
quence  D  consisting  of  four  clauses  from  (53(A),  let  IF£)(ei ,  62, 63)  be  the  prob¬ 
ability  that  in  the  first  execution  of  the  while  loop  on  line  (1)  of  PL'{D,  A,  h) 
the  pure  literal  chosen  on  line  (4)  was  U4,U5,  or  Ve  and  that  the  new  D  arising 
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at  the  end  of  line  (7)  is 

Vi  t;2  ^^2  ^3  ;  *  *  * 

i  > 

_|_  ^  _|_  6i  62  63  *  *  * 

where  *’s  indicate  arbitrary  entries. 

Let  (7(61,62,63)  =  E|r»|=4M7£((ei,62,e3).  The  assumption  that  the  result 
of  one  pass  through  the  while  loop  is  an  /(3, 5)-random  instance  implies 
(7(61,62,63)  is  independent  of  the  values  of  61,62,63.  Let  us  examine  what 
would  cause  an  imbalance  between  cr+  =  cr(+,+,+)  and  a-  =  cr(— ,—,—). 
We  need  only  consider  cases  where  ^4,^5,  or  Vq  is  pure.  If  either  or  both 
of  the  Ui, U2, us-clauses  listed  are  obtained  via  E  (on  line  (7))  then  these 
two  ui, U2, 'f^3-clauses  are  obtained  independently  and  so  no  imbalance  could 
occur.  So  we  can  restrict  our  attention  to  the  remaining  case  where  the  two 
Vi,  V2^  U3-clauses  listed  came  from  the  original  D  and  V4,  V5,  or  ve  is  pure.  We 
call  such  a  D  a  potential  threatening  contributor  to  or  (7_,  respectively. 
<7+  and  (7_  have  corresponding  potential  threatening  contributors.  The  only 
difference  is  that  in  such  contributors  to  (7_  Vi,V2,  and  V3  are  not  pure  and 
hence 

Prob(pure  chosen  is  in  {u4,U5,U6})  =  1 

whereas  in  such  contributors  to  (7+,  Vi,V2,  and/or  V3  may  be  pure.  Thus 
to  guarantee  that  (7+  is  as  large  as  <7_  the  weighting  scheme  when  handling 
potential  threatening  contributors  to  <74.  must  give  no  weight  to  any  pure 
literal  in  {ui,U2,U3}.  In  particular  if  the  original  D  is 

vi  V2  V3  ;  Ui  V2  V3  ;  V4  Vc,  ve  ;  V4  V5  v& 

<  > 

+  +  +  +  +  +  +  +  +  +  +  +^ 
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then  Ui,  U2,  and  V3  must  not  be  selected.  Mutatis  mutandi  ^4,^5,  and  vg  must 
not  be  selected.  We  have  arrived  at  a  contradiction. 

It  is  worth  noting  that  a  similar  argument  shows  that  FL'  never  decreases 
the  probability  of  impurity.  More  precisely,  let  V  be  a  set  of  n  Boolean 
variables,  V  a  set  of  n  —  s  Boolean  variables  (where  s  >  0),  C  a  sequence  of 
m  clauses  selected  independently  and  uniformly  from  Qk{V).,  C  a  sequence  of 
m  —  s  clauses  selected  independently  and  uniformly  from  Qk{V').  Let  t  >0. 
Let  E'  be  the  event  that  after  t  complete  passes  through  the  while  loop  of 
line  (1)  of  PL'{C',V',h)  there  is  a  pure  literal  and  let  E  be  the  event  that 
after  s  +  t  complete  passes  through  the  while  loop  of  line  (1)  of  PL'{C,  V,  h) 
there  is  a  pure  literal.  Then  Prob(£')<  Prob(£^'). 

3  Pure  Literal  Rule  for  2-SAT 

Using  a  different  analysis  we  now  show  that  the  pure  literal  heuristic  succeeds 
when  k  =  2  provided  the  ratio  of  the  number  of  variables  over  the  number 
of  clauses  is  asymptotically  greater  than  1. 

THEOREM  3.1 

Let  A  >  1.  Assume  n  >  Xm.  Let  C  be  a  random  sequence  of  m  clauses  from 
Q2{n).  Then 

Prob(the  pure  literal  rule  may  be  applied  to  C  until  at  most 
log2(n)  clauses  remain)  =  1  —  o(l). 

NOTES:  n  is  a  function  of  m. 

Throughout  §3  and  §4  all  asymptotic  notations  are  asymptotic  in  m. 
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REMARK  3.2 

The  proof,  especially  the  Configuration  Lemma  below,  is  based  on  the  Chvatal- 
Reed  [7]  proof  that  under  the  same  hypotheses 

Prob(C'  is  satisfiable)  =  1  —  o(l). 

More  generally  we  show: 

THEOREM  3.3 

Same  hypotheses  as  Theorem  3.1.  If  1  =  o(t),  then 

Prob(the  pure  literal  rule  may  be  applied  to  C  until  at  most 
t  clauses  remain)  =  1  —  o(l). 

NOTE:  i  is  a  function  of  m. 

PROOF.  For  notational  simplicity  we  assume  Am  is  an  integer.  Adding 
clauses  to  C  only  makes  it  harder  to  reduce  to  t  clauses.  So  we  may  as  well 
assume  n  =  Am. 

DEFINITION  3.4  (Pure  literal  block) 

A  pure  literal  block  {PL  block)  is  a  sequence  of  clauses  with  no  pure  literals, 
that  is,  every  variable  which  occurs  does  so  both  positively  and  negatively. 
To  prove  Theorem  3.3  it  sufl&ces  to  prove 

Prob(there  is  a  PL  block  of  size  at  least  t)  =  o(l). 

It  is  well  known  that  2-SAT  has  fast  algorithms  (see  e.g.  Aspvall,  Plass, 
and  Tarjan[l]).  Underlying  this  result  is  the  observation  that  a  2-clause 
/i  V I2  may  be  viewed  as  the  implications  ->li  — >  I2  and/or  -1/2  — ^  h-  Hence,  a 
sequence  of  2-clauses  may  be  viewed  as  a  directed  graph  on  the  set  of  literals. 
From  this  viewpoint  a  PL  block  5  is  a  sequence  of  directed  edges  so  that 
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every  vertex  of  B  is  both  the  initial  vertex  of  a  directed  edge  and  the  terminal 
vertex  of  a  directed  edge. 

DEFINITION  3.5  (Cycle) 

A  cycle  is  a  set  /i  — >■  I2,  k  h-,  -  •  •  Js  h  of  directed  edges.  Henceforth, 
we  write  /i  —>  /2  —>••••—>■  4  /i- 

As  it  has  only  finitely  many  vertices  any  PL  block  includes  a  cycle.  More 
generally  any  vertex  of  a  PL  block  is  either  on  a  cycle  or  lies  between  two 
cycles. 

PROPOSITION  3.6  (CONFIGURATION  LEMMA) 

Same  hypotheses  as  Theorem  3.1. 

a)  Prob(C'  has  two  cycles  connected  by  a  path)=  o(l). 

b)  Prob(C'  has  two  directly  connected  cycles)=  o(l). 

c)  Prob(C'  has  two  overlapping  cycles)=  o(l). 

PROOF. 

a)  Two  cycles  connected  by  a  path  consist  of 

li  ^  k  Ipi  -y  k,  the  first  cycle 

mi  — >■  m2  •  •  •  — mp2  mi,  the  second  cycle 

and 

Ii  —>•  ^2  np3  — )■  mi,  the  connecting  path. 

b)  Two  directly  connected  cycles  consist  of 
k  — ^  k  — ^  ■  ■  ■  — ^  Ipi  — ^  k^  one  cycle 

and 
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/i  m2  mp2  other  cycle, 

c)  Two  overlapping  cycles  consist  of 


h^h-^  — ^  ^/5i+P2  ^1) 

and 

/i  ->■  m2  ->••••  ->•  rup^  ->  Ip^ ,  the  nonoverlapping  part  of  the  other  cycle. 


The  proofs  for  cases  b)  and  c)  are  omitted  as  they  are  similar  to  the  proof 
for  case  a).  The  crucial  point  in  each  case  is  that  the  number  of  literals  used 
is  one  less  than  the  number  of  clauses  used. 

For  the  moment  fix  pi,  p2,  and  ps.  In  case  a)  one  must  choose  p  = 
Pi  +  P2  +  P3  clauses  and  p  —  1  literals  for  these  cycles  and  their  connecting 
path.  The  literals  may  be  chosen  in  at  most  (2n)^“^  ways  and  the  clauses  may 
be  chosen  in  at  most  ways.  Thus,  the  probability  of  this  configuration 
for  fixed  pi,P2,  and  pa  is  at  most 

(2n)P-^m^  _  /  n  y 

(2^(2))^  2nA^Vn  — 1/ 

<  (as  p  <n) 

-2nXp^  ’ 

So  the  probability  of  this  configuration  for  some  pi,p2,  and  pa  with  p  = 

4  P^ 

Pj  -1-^2  +  Pa  is  at  most  — ^ .  Hence,  the  probability  of  this  configuration  for 

2n 

some  P11P2,  and  pa  is  at  most 

2n^A^ 


By  the  ratio  test 

p>i 


is  finite  and  so  the  desired  probability  is 
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The  Configuration  Lemma  tells  that  by  ignoring  a  o(l)  piece  of  the  sample 
space  of  formulae,  we  may  assume  that  no  cycles  in  a  formula  are  connected 
to  one  another.  Thus,  the  proof  of  Theorem  3.3  is  completed  by  showing  ; 
PROPOSITION  3.7 
Same  hypotheses  as  Theorem  3.1. 

Prob(the  sum  of  the  lengths  of  all  cycles  in  C  is  at  least  t  and 
no  cycles  of  C  are  connected  to  one  another)  =  o(l). 

PROOF.  Say  the  sum  of  the  lengths  is  t'  >  t. 

Say  there  are  n,  cycles  of  length  U  for  i  =  l,...,s  with  total  length  f  = 

$ 

E  mU- 

i—l 

As  /  /  is  not  a  clause,  each  ti  >  2. 

One  must  choose  clauses  and  f  literals  for  these  cycles.  The  literals 
may  be  chosen  in  at  most  (2n)*^  ways  and  the  clauses  may  be  chosen  in  at 
most  ways.  But  as  any  cycle  of  length  ti  may  be  rotated  in  ti  ways,  and 
the  rii  cycles  of  length  ti  may  be  permuted  in  n*!  ways,  the  number  of  ways 
to  choose  literals  was  overcounted  by  a  factor  of  fl  ti^'ni\. 

i~X 

So  the  probability  of  Ui  cycles  of  length  for  «  =  1, . . . ,  s  is  at  most 

1  {2ny'm^'  ^  4 

n  (2^(2))*'  ~  ,n 

It  is  well  known  that  in  the  uniform  distribution  on  the  set  of  permutations 
on  t'  letters 

1 

n 

is  the  probability  that  a  permutation  on  t'  letters  has  precisely  n,-  cycles  of 
length  ti  for  i  =  1, . . . ,  s. 
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So  the  probability  that  the  sum  of  lengths  of  all  cycles  is  f  is  at  most 


Prob(a  permutation  on  f  letters  has  no  fixed  point)^  < 


So  the  probability  the  sum  of  the  lengths  of  all  cycles  of  C  is  at  least  t  and 
no  cycles  of  C  are  connected  is  at  most 


t'>t 


So,  as  1  =  o(i),  this  probability  is  o(l). 


Given  the  phenomenal  success  of  the  pure  literal  heuristic  under  the  hy¬ 
potheses  of  Theorem  3.3  it  is  reasonable  to  ask  if  under  the  same  hypotheses 
the  pure  literal  rule  asymptotically  almost  always  eliminates  all  clauses.  This 
is  false  as 

PROPOSITION  3.8 

If  n  ~  Xm  where  A  >  1 ,  then  the  probability  there  is  a  cycle  of  length  2  is 
asymptotically  >  0. 

PROOF.  Let  p2  be  the  probability  there  is  no  cycle  of  length  2.  We  show  p2 
is  asymptotic  to  exp  • 

The  main  source  of  difficulty  in  computing  this  probability  is  that  clauses 
may  occur  more  than  once.  So  we  write  p2  as: 


Prob(there  are  t  repetitions  of  clauses  and  no  cycle  of  length  2) 
<>o 


Pick  f  so  that  1  =  o{t')  and  t'  = 

CLAIM  3.9 

Prob(there  are  at  least  t'  repetitions)  =  o(l). 
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PROOF  OF  CLAIM  3.9.  Say  we  get  exactly  t  repetitions  as  we  list  the  m 
clauses.  Say  these  occur  as  the  •  •  • )  clauses  where  <  *2  <  •  •  •  <  it- 

Then  there  are  ii  —  \  choices  for  the  first  repetition,  *2  “  2  choices  for  the 
second  repetition,. . . ,  and  it  —  t  choices  for  the  repetition.  And  there  are 
at  most  choices  for  the  other  m  —  t  clauses  (where  N  =  n(n  —  1)). 

Thus,  the  probability  of  exactly  t  repetitions  is  at  most  — where 

St,m= 

By  induction  we  have: 

LEMMA  3.10 


(m  —  ty* 

¥ir 


^  ^t,Tn  ^ 


m 


2t 


2H\ 


1  fm'^ 

So  the  probability  of  exactly  t  repetitions  is  at  most  —  I 


.  So  the 


probability  of  at  least  t'  repetitions  is  at  most  ^ 
Thus,  p2  is  asymptotically 


2\t 


,  which  is  o(l)  as 


'Y  Prob(there  are  t  repetitions  of  clauses  and  no  cycle  of  length  2). 
o<t<t' 

As  no  cycle  of  length  2  is  produced  as  we  list  the  m  clauses  if  and  only  if  each 
clause  not  repeating  a  previous  clause  avoids  the  converse  of  each  previous 
clause,  a  similar  argument  shows 


Prob(there  are  t  repetitions  of  clauses  and  no  2  cycle  )  =  st,m 
By  Lemma  3.10  as  t'  =  St,m  is  uniformly  asymptotic  to 


N\ 


2‘A'”(A  -  (m 


2t 


m 

¥i\ 


for  t  <  t' . 


t))V 
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Thus,  p2  is  asymptotic  to 


E 

0<t<t 


m2‘iV! 


,  -  (m  -  t))\' 


By  Stirling’s  formula  and  ln(l  -  x)  =  -x  -  —  +  O(x^),  this  is  asymptotic  to 


^  (W )  ""P 


m 


0<t<t' 


2N 


By  the  power  series  expansion  of  exp(x),  this  is  asymptotic  to 


\  I  m 
exp  1  ^  )  exp  (  - 


2N, 


which  is  asymptotic  to  exp 


1 


Proposition  3.8  is  in  striking  contrast  to  Broder,  Frieze,  and  Upfal’s  [2] 

results  for  the  pure  literal  rule  for  3-clauses.  They  show  that  in  their  model 

m 

with  asymptotically  1  probability  there  is  no  pure  literal  block  provided  — 
is  sufficiently  small  (<  1.63). 

Proposition  3.8  and  Erdos  and  Renyi’s  [9]  results  on  the  occurrences  of 
cycles  in  random  graphs  suggest  the  following  conjectures,  assuming  Aim  < 
n  <  X2m  where  the  Aj’s  are  constants  >  1. 

CONJECTURE  3.11 

For  any  fixed  t  the  asymptotic  probability  there  is  a  PT  block  of  size  t  is 
strictly  between  0  and  1. 

CONJECTURE  3.12 

The  asymptotic  probability  there  is  a  PL  block  of  some  size  is  strictly  between 
0  and  1.  (By  the  Configuration  Lemma,  Conjecture  3.12  is  equivalent  to:  The 
asymptotic  probability  there  is  a  cycle  of  some  length  is  strictly  between  0 
and  1.) 
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4  Optimality  of  Theorem  3.1 


We  show  Theorem  3.1  is  near  optimal  by  showing  if  n  =  Am  where  0  <  A  <  1, 
then  with  asymptotic  probability  1  there  is  a  PL  block  of  size  nf  for  some 
£  >  0.  This  follows  immediately  from  the  proof  in  Chvatal  and  Reed  [7] 
that  for  such  A,  Prob(C'  is  satisfiable)=  o(l).  They  use  the  second  moment 
method  to  show  Prob(C'  has  a  “snake”  of  size  m^)=  1  —  o(l)  for  any  £  <  r- 

O 

It  is  trivial  to  observe  a  snake  is  a  PL  block. 

A  larger  e  with  a  simpler  proof  may  be  obtained  by  using  cycles  instead 
of  snakes. 

THEOREM  4.1 

Let  0  <  A  <  1.  Assume  n  <  Am.  Let  s  =  o(n^/^).  Let  C  be  a  random 
sequence  of  m  clauses  from  Then 

Prob(the  pure  literal  rule  may  be  applied  to  C  until  fewer  than 
s  clauses  remain)  =  o(l). 

NOTE:  s  is  a  function  of  m.  It  suffices  to  prove  this  theorem  for  1  =  o(s). 
PROOF.  As  in  Theorem  3.3  we  may  as  well  assume  that  Am  is  an  integer 
and  n  =  Am. 

DEFINITION  4.2  (s-cycle) 

An  s-cyclc  is  a  cycle  li  — ^  I2  — ^  ■  ■  ■  — ^  L  — ^  ^1  iii  which  /i, . . . ,  /^  have  distinct 
variables. 

We  show  Prob(C  has  an  s-cycle)=  1  —  o(l). 

For  any  s-cycle  A  let 


{1  if  each  clause  of  A  occurs  exactly  once  in  each  C 
0  otherwise 
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Let  X  =  E{xa  I  a  is  an  s-cycle}.  We  show  with  probability  1  —  o(l)  some 
s-cycle  occurs  by  showing  Prob(x  >  0)=  1  -  o(l).  This  is  accomplished  by 
the  second  moment  method.  That  is,  we  show 

E(x")  =  E{xf{l  +  o(l)). 

Then  by  Chebyshev’s  inequality, 

Probdx  -  £(x)|  >  Eix))  <  =  “(1) 

and  so  Prob(x  =  0)=  o(l). 

As  in  [7], 

EixA)  =  (1  +  o(l))  uniformly  in  A. 

There  are  =  (2n)*(l  +  o(l))  ways  to  choose  the  vertices  of  an 

5-cycle.  So 

E(x)  =  Q)‘ a + om 

Also  if  A  and  B  are  s-cycles  sharing  exactly  i  edges,  then 

/  \  25— t 

E{xaXb)  =  (  ^ j  (1  +  ®(1))  uniformly  in  A,  B,  and  i. 

Thus 

E{xaXb)  =  (  ~  1  E{xa)E{xb){1  +  o(l))  uniformly  in  A,  B,  and  t. 

Let  pi(n)  (usually  written  p,)  be  the  probability  that  a  random  s-cycle  B 
shares  i  edges  with  a  fixed  s-cycle  A. 

So 

£(x^)=i:p<(^)'-E(x’)(i+o(i))' 
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The  main  work  involves  obtaining  estimates  of  the  pi.  As  there  are 
(2n)*(l  -f  o(l))  s-cycles, 

number  of  s-cycles  B  such  that  A  and  B  share  i  edges 
(2n)-(l  +  0(1))  ■ 

Let  An  B  denote  the  edges  in  both  A  and  B.  Let  k  be  the  number  of 
connected  components  oi  A  C\  B.  Let  N{i,  k)  be  the  number  of  s-cycles  B 
such  that  An  B  has  i  edges  and  k  components. 

So  for  i  >  0 

Efc>i  k) 

(2n)«(l  +  o(l))- 

For  0  <  i  <  s,  N{i,k)  may  be  overestimated  by  the  product  of 

i)  the  number  of  ways  of  placing  in  A  the  k  components  of  An  B 

ii)  the  number  of  ways  of  placing  in  B  the  k  components  of  An  B 

iii)  the  number  of  ways  of  placing  in  B  the  edges  for  each  component  of 

AnB 

and 

iv)  the  number  of  ways  of  assigning  literals  to  the  vertices  in  B,  but  not 
in  An  5. 


i)  and  ii)  may  be  overestimated  as  follows:  Mark  the  beginning  of  each 

component  with  a  -|-  and  the  end  with  a  — .  So  we  must  choose  positions  for 

/«\ 

the  2k  markers  giving  at  most 


< 


.2k  -  {2k)\ 


choices. 


iii)  is  the  number  of  ways  of  permuting  same  size  components  of  A  n  jB 
in  B.  This  is  largest  when  all  the  components  are  the  same  size  and,  hence, 
is  at  most  k\. 
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As  i  <  s,  AC\  B  has  i  +  k  vertices  and  hence  B  has  s  —  (i  +  k)  vertices 
not  in  An  5.  So  iv)  is  at  most  Thus, 


So 


(2Jt  \  2 

W(2n)-C+‘)  < 


1  ^  .  1  5^  1 


-  (2n)^- (2n)^  '  (2n)«  2n  ^ 


(2n)*+*^ 

(1+0(1)). 


2n 


As  5  = 


1 


Pi  <  - — ^ — (1  +  o(l))  uniformly  in  i  for  0  <i  <  s. 
[2ny  2n 

If  2  =  s,  the  only  choice  for  B  is  which  vertex  of  A  is  if . 


So 


So 


So 


So 


Ps 


(2n) 


-(1  +  o(l). 


^2Pi  ^ 

i=l 


+  (1  + »(!))  = ‘’(D- 


(2ra)*  2n^(2n) 

Po  =  l-  o(l) 


2n 


2\  * 


Eix'^)  -  E{xy  =  EaW  —1  £;(x)^(l  +  0(1)). 


t=i 


By  the  above  estimates  of  pi 


B(x^)  -  E(xr  <  A  E  (£)'  E(xm  +  0(1))  +  O  (^)’  EixYil  +  0(1)). 

t—1 


But  —  E  =  <>(1)  as  —  =  A  <  1  and  s  =  o{n^^*)  ;  and  5  (  — )  = 

2n  ^  \mJ  ^  ’  m  \mj 


2n  ^  \m 

Z=1 

71 

o(l)  as  —  =  A  <  1  and  1  =  o(s). 
m 
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QUESTION  4.3 

How  much  larger  of  a  pure  literal  block  occurs  asymptotically  almost  always 
for  n  =  Am  where  0  <  A  <  1? 

5  Topics  for  further  study 

For  2-clauses  we  have  seen  there  is  an  abrupt  transition  in  the  performance 
of  the  pure  literal  heuristic.  It  occurs  for  n  =  Am  as  A  switches  from  <  1 
to  >  1.  It  is  striking  that  this  is  the  same  A  at  which  occurs  the  abrupt 
transition  in  2-satisfiability  shown  by  Chvatal  and  Reed  in  [7]  and  Goerdt  in 
[15].  Is  there  a  comparable  transition  in  the  performance  of  the  pure  literal 
heuristic  for  A;-clauses  for  fc  >  3  ?  First  one  should  find  an  a  (depending  on 
k)  such  that  for  n  >  am,  asymptotically  there  are  almost  never  pure  literal 
blocks  of  size  at  least  log(m).  And  one  should  find  a  /?  (depending  on  k)  such 
that  for  n  <  /3m,  asymptotically  there  are  almost  always  pure  literal  blocks 
of  size  m®  for  some  e  >  0.  Next  one  should  determine  if  as  for  k  =  2,  the 
inf  of  the  possible  a’s  =  the  sup  of  the  possible  /3’s.  We  are  confident  that 
unlike  for  2-clauses  such  a  transition  would  be  larger  than  the  conjectured 
transition  for  satisfiability. 
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